Application of Reinforcement Learning to Batch Distillation
نویسندگان
چکیده
An important amount of work exists on the topic of optimal operation and control of batch distillation though it is still based on the assumption of an accurate process model being available. While this assumption is valid from a theoretical point of view, there will always remain the challenge of practical applications. Reinforcement Learning (RL) has been recognised already as a particularly suitable framework for optimizing batch process operation however no application to batch distillation has been reported. Thus, this paper presents RL as an automatic learning approach to batch distillation. The methodology is exemplified using various case studies. INTRODUCTION Distillation is one of the most widely used unit operations in the fine chemical, petroleum and pharmaceutical industries. It is one of the oldest methods of separation of liquid mixtures into their various components depending on differences in boiling points of liquids and relative volatility. The rising importance of high-value-added, lowvolume specialty chemicals has resulted in a renewed interest in batch processing technologies (Diewkar, 1995) and the drive for optimum operation is ever present. Batch distillation is an important and widely used separation process in batch process industry. Its main advantage over continuous operation is the ability to be used as a multi-purpose operation for separating mixtures into their pure components using a single column. Batch distillation can also handle a wide range of feed compositions with varying degrees of difficulty of separation (e.g. wide ranges of relative volatilities and product purities). Although the typical consumption of energy is more than in continuous distillation, more flexibility is provided with less capital investment (Luyben, 1992). However, besides the flexibility in the operation of batch distillation columns, a range of challenging design and operational problems occur due to its inherent unsteady state nature. LITERATURE SURVEY The main sequence of events in operating a batch distillation column starts with the feed charged into the reboiler. The column is then operated at total reflux until the column reaches steady state. This initial phase is known as the start-up phase. In the second phase, or production phase, light component product is collected into a product tank until its average composition drops below a certain specified value. This cut is referred to as the main cut (The 1 st main cut is sometimes preceded by taking off the low boiling impurities at a high reflux ratio). After that, the first intermediate distillate fraction (off-cut or slop cut) is produced and stored in a different tank. This procedure is repeated with a second main cut and second slop cut and so on until the concentration of the heaviest component, in the reboiler of the column, reaches a specified value. At the end of the batch, the operation of the distillation column goes through a shutdown phase. Slop cuts contain the material distilled, which does not meet specification. Considerable work in slop handling strategies has been reported in the literature ((Bonny et. al., 1994) and (Mujtaba and Macchietto, 1992)). On the other hand, a totally different operating policy is the cyclic operation of a batch distillation column. In the case of a regular column, the cyclic operation could be characterised by repeating a three period operation (Sorensen, 1997): Filling, Total Reflux, and Dumping. The main manipulated variable, in the process of controlling a batch distillation column, is the reflux ratio. The frequently used and conventional approach towards controlling the operation of a batch distillation column, during the production of main cuts, is either to operate at constant reflux ratio or to operate at a varying reflux ratio (constant distillate composition). During operation at constant reflux ratio, the distillate composition is allowed to vary resulting in a simpler strategy and hence it is more commonly used in industry. The second approach is conducted by maintaining a fixed overhead composition while varying the reflux ratio. The two approaches used are simple but provide sub-optimal results.
منابع مشابه
Reinforcement learning for robot soccer
Batch reinforcement learning methods provide a powerful framework for learning efficiently and effectively in autonomous robots. The paper reviews some recent work of the authors aiming at the successful application of reinforcement learning in a challenging and complex domain. It discusses several variants of the general batch learning framework, particularly tailored to the use of multilayer ...
متن کاملKnowledge Transfer for Deep Reinforcement Learning with Hierarchical Experience Replay
The process for transferring knowledge of multiple reinforcement learning policies into a single multi-task policy via distillation technique is known as policy distillation. When policy distillation is under a deep reinforcement learning setting, due to the giant parameter size and the huge state space for each task domain, it requires extensive computational efforts to train the multi-task po...
متن کاملPrediction of the Operating Conditions in a Batch Distillation Column Using a Shortcut Method
A shortcut procedure as quick, easy-to use method for design and simulation of multicomponent batch distillation is used to predict the operating condition of recovering xylene from solvent in an existing batch distillation column in benzol refinery. The procedure can be used to investigate the effect of the operating parameters on the operation of column for three possible modes of batch d...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملBatch Reinforcement Learning
Batch reinforcement learning is a subfield of dynamic programming-based reinforcement learning. Originally defined as the task of learning the best possible policy from a fixed set of a priori-known transition samples, the (batch) algorithms developed in this field can be easily adapted to the classical online case, where the agent interacts with the environment while learning. Due to the effic...
متن کامل